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fact  it  possesses  Minque  optimality  for  a particular  choice  of  norm,  but  also 
various  other  optimality  properties  and  necessary  and  sufficient  conditions  for 
estimability  associated  with  Minque  simplify  considerably.  .Moreover  we  are  able 
to  derive  sufficient  conditions  for  consistency  which  also  provide  estimability 
conditions  of  a simpler  structure.  The  consistency  of  cur  estimators  makes  them 
convenient  as  starting  points  for  a single  cycle  to  obtain  asymptotically 
fully  efficient  estimates. 


A SIMPLE  'SYNTHESIS '-BASED  METHOD  OF  VARIANCE  COMPONENT  ESTIMATION 


by 

H.  0.  Hartley*,  J.  N.  K.  Rao^  and  Lynn  LaMotte^ 

1,  Introduction 

In  this  paper  we  do  not  attempt  an  evaluation  of  the  ever  growing  methodology 
in  the  estimation  of  variance  components.  (For  an  excellent  summary  of  the  literature 
up  to  1971  see  Searle  (1971).)  Optimality  properties  are  sometimes  achieved  at  consi- 
derable computational  efforts.  A case  in  point  is  the  M.  L.  estimation  (see  Hartley  & 

Rao  (1967))  which  is  still  fairly  laborious  for  large  data  banks  in  spite  of  the  improve- 
ments through  the  W- transformation  (Hemmerle  & Hartley  (1973)).  Similar  observations 
apply  to  the  general  case  of  Minque  (C.  R.  Rao  (1971))  recently  simplified  by  Liu  & 
Senturla  (1976).  Other  methods,  such  as  the  Henderson  3 Method  (Henderson  (1953))  or 
the  Abbreviated  Doolittle  and  square  root  method  (see  e.g.  Gaylor,  Lucas  and  Anderson 
(1970))  depend  on  a subjective  ordering  of  the  components  (such  as  with  the  Forward 
Doolittle  procedure)  and  if  the  ordering  is  unfortunate  the  method  may  fail  to  yield 
estimates  for  certain  components  while  with  a different  ordering  (not  attempted)  all 
components  nay  well  be  estimable.  The  work  Involved  in  attempting  all  possible  order- 
ings of  the  variance  components  is  usually  prohibitive.  The  present  method  achieves 
optimality  properties  and  is  nevertheless  computationally  simple.  In  fact  it  possesses 
Minque  optimality  for  a particular  choice  of  norm,  but  also  various  other  optimality 
properties  and  necessary  and  sufficient  conditions  for  estlmabllity  associated  with 
Minque  simplify  considerably  (see  Section  6).  Moreover  we  are  able  to  derive  suffi- 
cient conditions  for  consistency  which  also  provide  estlmabllity  conditions  of  a simpler 
structure  (see  Appendix).  The  consistency  of  our  estimators  makes  them  convenient  as 
starting  points  for  a single  ML  cycle  to  obtain  asymptotically  fully  efficient  estimates. 

2.  The  Mixed  ANOVA  Model 

Employing  the  currently  used  .notation  we  write  the  mixed  ANOVA  model  in  the  forn< 
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c+1 

y - Xa  + Z U b (1) 

1-1  ^ ^ 

where 


y Is  an  n X 1 vector  of  observations, 

X Is  an  n X k matrix  of  known  coefficients, 
a Is  a k X 1 vector  of  unknown  constants. 

Is  an  n X matrix  of  0,  1 coefficients, 

2 

bj^  Is  an  m^^  X 1 vector  of  normal  variables  from  N(0,  o^)  . 

Specifically  and  b^^^  Is  an  n-vector  of  "error  variables". 

Moreover  the  design  matrices  have  precisely  one  value  of  1 In  each  of 

c 

their  rows  and  all  other  coefficients  0.  We  denote  by  m - r m the 

1=1 


total  number  of  random  levels. . 

We  may  assume  without  loss  of  generality  that 


X'X  = I (2) 

for  If  (2)  Is  not  satisfied  we  may  orthogonallze  X by  a Gram  Schmidt 

orthogonallzatlon  process  with  a consequential  reparameterization  of  a 

omitting  any  linearly  dependent  columns  In  the  Gram  Schmidt  process. 

Usually  the  first  column  of  X Is  the  column  vector  with  all  elements  = 

l//n.  It  is  the  objective  of  the  method  to  compute  estimates  of  the 

2 

variance  components  and  the  vector  a. 


3.  The  Present  Method 


The  essence  of  the  present  method  Is  to 

(a)  Select  c+1  quadratic  forms  Qj(y)  In  the  elements  of  y. 

(b)  Use  the  method  of  synthesis  (Hartley  (1967),  Rao  (1968))  to 
obtain  the  coefficients  k^^^  In  the  formulas  for  E(Qj)  In  the 


;:ta  SicUta 
.n  Bactto 


in 


form 


BY 


^ 

□ □ 
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E(QJ  - I . 


(c)  Estimate  by  equating  the  computed  to  their  expectations 

by  Inverting  the  system  (3)  to  compute  the  vector  o with 
“2 

elements  o. 


from  the  vector  jg(y)  with  elements  Qj(y)  where  K •»  (kj^)  with 

rank  to  be  discussed  In  Section  6 and  7. 

*2 

(d)  Replacing  any  negative  elements  of  a by  0,  with  consequences 
to  be  discussed  in  Section  7. 

We  now  give  more  details  for  (a),  (b)  and  (c): 

(a)  The  Qj (y)  will  be  based  on  contrasts  which  do  not  depend  on 

any  elements  of  a.  Accordingly  we  orthogonallze  all  U.  matrices 
on  X and  construct  matrices  orthogonal  on  X as  follojcs:  De- 
note by  u(t,i)  the  tth  column  vector  of  and  by  x(r)  the  rth 
column  vector  of  X then  the  columns  v(t,l)  of  are  given  by 


v(t,l)  - u(t,i)  - E x(r)  {x'(r)u(t,l)} 
r-i 


“ u,  - xx-u,  . J 

We  now  choose  the  cfl  quadratic  forms  (y)  as 

Qj(,)  -y'V^Vjy.  (V',)’Vjy  J-1 c^X  (6: 

(b)  It  follows  from  the  method  of  synthesis  (see  Hartley  (1967), 
J.  N.  K.  Rao  (1968))  that 


E Q,(y)  - I 


- Z (Vj  u(t,l))'(Vj  u(t,i)) 

Now  since  v(T,j)  Is  orthogonal  on  any  x(p)  (i.£.  since 
v'(T,j)x(p)  = 0)  we  can  write  the  k^^  in  the  alternative  form 

kji  - Z (Vj  v(t,i))'(V^v(t,i)) 

(8) 

- ZZ  {v'(T,j)  v(t,l)}^ 
tT 

showing  that  k^j  = k^^j^  . 

An  alternative  form  of  k^^  is 

kj^  - tr{(VjVp(V^V^)}  . (9) 

We  shall  show  in  Section  6 that  the  symmetrical  matrix 
K *■  will  have  full  rank  c+1  if  the  n x n matrices 

are  not  linearly  dependent. 

(c)  We  shall  also  show  in  Section  6 that  the  system  of  equations 

)8  “ (10) 

is  consistent  even  if  the  rank  of  K is  degenerate.  Solving 
(10)  in  the  form 

- k"  Q (11) 

% % 

we  shall,  of  course,  be  particularly  interested  in  the  full 
rank  case  when  K ■ K 


4.  Thu  Comput.itional  Load 

It  may  be  helpful  to  give  an  idea  of  the  computational  efficiency  of 


the  present  method  by  tabulating  the  nuni>cr  of  products  involved  In  the 
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nain  operations  of  the  algorithm.  To  this  end  we  first  note  simplified 
versions  for  the  k . , . : Observing  that  U ■ I we  have  from  (5)  that 

C+1 » 1 CTl 

V - I - XX'  and  since  X’X  - I we  find  that  V - I - XX'  and 

c+1  c+1  c+1 

finally  from  (9)  that 


- tr  (I  - XX') (I  - XX')  - tr  (I  - XX') 
“ n - k . 


(12) 


Similarly  we  find  that 


’^C+l.i  - - XX')(V^v;)} 

- tr  {V^V^  - XX'V^Vj} 


- tr  . 

Further  we  note  the  form  of 


(13) 


v;^iy  - y - XX'y 


(lA) 


Defining  now  the  adjoined  matrices 


U - (Ui  |...|  U^)  V-(Vj...jV^)  (15) 

the  bulk  of  the  work  consists  of  the  formation  of  the  elements  of  the 

symmetrical  matrix  V'V  = V'U  * U'V.  The  elements  of  this  matrix  are 
assembled  in  submatrices  in  accordance  with  the  partition  (15)  as  shown 
in  the  Schedule  1 below  where  it  must  be  remembered  that  the  range  of 
the  column  index  t depends  on  i and  is  t “ 1,  . . . , m^  and  the  range  of 
T ■ 1»  . . . , so  that  the  submatrix  has  dimensions  m^  x m^.  Tlie 

for  1 i J “ 1 , . . . , .c  are  then  obtained  by  forming  the  sums  of 
squares  of  the  elements  in  each  submatrix  in  accordance  with  (7). 

Finally,  we  recite  the  formulas  for  the  remaining  coefficients  in 
the  equations  (10).  Tlie  k^^^^  and  k^j^  ^ are  computed  from  (12)  and 


(13)  respectively  and  the  right  hand  sides  of  Qj(y)  from  the  second  form 
In  (6)  for  j » 1,  c while  given  in  accordance  with  (14)  by 

Qc+i(y)  » y’y  - (x’y)’(x*y)  (i6) 

We  can  now  summarize  the  approximate  nunfcer  of  products  involved  in 
the  various  operations  of  the  algorithms. 


Operation 

Approximate  No.  of  Products  Involved 

Orthogonalization  of  X'X 

(l/2)k'^(k'^  - l)n 

where  k"*^  = ^ of  columns  in  original  X 

X'Uj^  i = 1,  ...»  c 

kmn 

X(X'Uj^)  i - 1,  ....  c 

(equation  (5)) 

nn^ 

u'v  - V’V 
(Schedule  1) 

0 

Subtotals  of  elements  of  v(t,l) 

‘‘ij  ^ ® 

(equation  (7)) 

(l/2)m(iiri-l) 

i - 1.  •••*  c 
(equation  (13)) 

nn 

Operation 


Approximate  No.  of  Products  Involved 


*^c+l,c+l 
(equation  (12)) 

0 

Qj(y)  j = 1«  •••*  c+ 1 

(mfk+l)  (n+1) 

(equations  (6) , 2nd  form 

and  (16)) 

The  important  point  is  that  the  number  of  products  is  only  a linear 

function  of  the  number  of  data  lines  n.  An  approximate  formula  for  the 

14'  + 

total  number  of  products  is  n{^  k (k  - 1)  + (2m+l)(k+l)} 

5.  A Numerical  Example 

A small  numerical  example  with  n ■=•  4,  k^  ■ 3,  k ■ 2,  c ■ 1,  m^^  « 2, 
m“2,  m2“n=4i8  shown  in  schedule  2 below. 


Schedule  2;  A Numerical  Example  of  a Mixed  Model 


y 

X Original 

«2 

X 

new 

4 

1 

1 

0 

1 

0 

1 

0 

0 

0 

(1/2) 

(1/2) 

4(1/2) 

-(1/2) 

2 

1 

1 

0 

0 

1 

0 

1 

0 

0 

(1/2) 

(1/2) 

-(1/2) 

4(1/2) 

1 

1 

0 

1 

0 

1 

0 

0 

1 

0 

(1/2) 

-(1/2) 

0 

0 

*2 

1 

0 

1 

0 

1 

0 

0 

0 

1 

(1/2) 

-(1/2) 

0 

0 

The  orthogonolizatlon  of  X (original)  to  X (new)  follows  the  standard  Gram 
Schmidt  procedure  and  reduces  the  k^  ■■  3 dependent  columns  to  k •=  2 columns 
which  are  orthogonal  and  standardized.  Note  that 

*<2)ncw  " " <l/2)x(l)^^^  and 

x(3)  . . x(l)  - x(2)  must  be  eliminated, 
old  new  new 

Using  now  x(r)  - ^^’^^new  o’^thoBonalize  Uj^  on  X and  compute  (see  (5)) 


anJ  hence 


x’(l)  u(l,l)  -+(1/2),  x'(2)  u(l.l)  -+(1/2) 

v(l,l)  - u(l.l)  - (l/2)x(l)  - (l/2)x(2) 

likewise 

x’(l)  u(2,l)  - (3/2),  x'(2)  u(2,l)  = -(1/2) 

and  hence 

v(l,2)  - u(2,l)  - (3/2)x(l)  + (l/2)x(2)  . 

This  yields  the  matrix  In  schedule  2 which  has  only  one  Independent 
column.  The  elements  of  require  the  computation  of 

v(l,l)'  u(l,l)  - (1/2);  v(l,l)'’ u(2,l)  = v(2,l)’  u(l,l)  - -(1/2) 

and 

v(2,l)'  u(2,l)  = 1/2  with  sum  of  squares  of  - 4(1/2)^  = 1. 

Further  (equation  (12))  k22  - 4 - 2 - 2 and  (equation  (13))  k^2  “ ^^21  ~ 

4(1/2)^  + 4(0)“  = 1 so  that  the  K matrix  is  given  by  K = 2)  • 

Finally,  (equation  (16)) 

Q2(y)  =4^+2^  + 1^  + 2^  - (i  9)^  - (|  3)^  = 25  - ^ = 25  - 22.5  = 2.5 

aid  (equation  (6))  Qj^(y)  = (y  2)^  + (y(-2))^  = 2 . 

«2  *2  *2 
The  solution  of  Q = Ko  therefore  yields  a«  = 1/2,  a,  •=  1.5  . 

6.  Optimality  Properties  and  the  Consistency  of  the  Equations 

The  estimators  described  In  Section  3 may  be  seen  to  be  "best  at  0^  = 0, 

1-1,  ...,  c,  - 1"  as  defined  by  L.  R.  LaMotte  (1973).  Therefore,  the 

consistency  of  equation  (10),  regardless  of  the  rank  of  K,  Is  established  as 

Lemma  4 by  LaMotte  (1973)'.  That  the  estimators  defined  by  (11)  are  "best" 

among  Invariant  quadratic  unbiased  estimators  guarantees  that  they  are  admissible 

In  that  class:  that  Is,  no  other  Invariant  quadratic  unbiased  estimators  have 

uniformly  less  variance  for  all  o.  Further,  as  noted  by  LaMotte  (1973) , the 

estimators  (11)  have  the  property  that  In  any  model  for  which  a uniformly  best 
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estlmator  exists,  (11)  will  be  uniformly  best.  Finally,  It  may  be  seen 
that  the  "synthesis"  estimators  (11)  are  also  MINQUE  as  In  Rao  (1971, 

Section  6)  with  V = I.  No  claim  Is  made  that  this  choice  of  the  norm  has 
any  particular  merits  among  the  rather  general  family  of  the  norms  covered 
by  Mlnque  formulas.  Hovever;  It  appears  to  be  reasonable  to  us  that  In  the 
absence  of  any  theoretical  criteria  for  selection  of  Mlnque  norms  a norm 
leading  to  simple  estimators  may  be  regarded  as  meritorious. 

Following  Section  A5  In  LaMotte-  (1973)  , It  may  be  seen  that  the  rank 
of  K Is  equal  to  the  number  of  linearly  Independent  matrices  among 
1=1,  ...,  c+1.  Thus  a singular  K may  occur  If  the  matrices  are  not 

all  linearly  Independent  or  If  there  exists  (see  (5))  a linear  combination 
of  the  matrices  whose  columns  are  contained  In  the  linear  subspace 

spanned  by  the  columns  of  X.  In  the  first  case  the  singularity  Is  caused 
by  the  design  leading  to  the  matrices,  while  In  the  second  the  singularity 
Is  caused  by  confounding  fixed  and  random  effects.  In  either  case,  (10) 

Is  consistent  but  some  linear  combinations  of  the  variance  components  can 
not  then  be  unblasedly  estimated.  We  should  stress  however  that  other 
special  cases  of  Mlnque  (not  necessarily  Invariant  to  a)  may  also  deserve 
particular  attention. 
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APPENDIX 

“2 

The  Asymptotic  Consistency  of  a 

In  discussing  the  asymptotic  behavior  of  It  Is  of  course  necessary  to 
specify  the  limiting  process  under  which  such  properties  are  supposed  to  hold. 
Clearly  It  is  necessary  for  the  consistent  estimation  of  the  variances  = 

Var  that  the  number  of  elements  m^  in  the  vectors  b^  all  tend  to  <»,  For  the 
identity  matrix  ^c+1  have  = n the  overal  sample  size.  For  the  re- 

maining m^  we  assume  that  their  limiting  behavior  is  related  to  n by 

1-a  1-a. 

Ln  £ Un  ^ (17) 

where  0 ^ < 1 and  L, U are  universal  constants.  More  specifically  we  assume 

that  a = 0 but  o.  > 0 for  i = 1,  ...,  c.  Generalizations  to  situations  in 
c+i  i 

which  = 0 for  several  components  are  under  consideration. 

Denote  now  by 

v(t,  1)  “ number  of  elements  in  u(t,  1)  which  are  1 (18) 


v(t,  i;  T,  j)  = number  of  rows  in  which  both  u(t,  i)  and 

u(t,  j)  have  elements  1.  (19) 

Using  these  concepts  we  introduce  the  following  conditions  of  'pseudo  orthogon- 
ality* of  the  u(t,  1)  vectors.  We  assume  that 


u.  u. 

£n  j<  v(t,  1)  £ u n (20) 

(where  I,  u are  universal  constants)  and  that 


11 


v(t,  1;  T,  j)  « o(v(t,  j)) 


(21) 

1 y j with  1*1,  c + 1 

and  j “ 1 , ....  c 


The  relationship  between  (17)  and  (20)  Is  obvious  since  J]  v(t,  1)  = n so  that 

t-1 

(20)  Implies  (17)  with  ^ “ T and  L • — and  the  stronger  condition  (20)  Implies 

x>  u 

a uniform  order  of  magnitude  for  all  v(t,  1)  In  a given  Uj^.  Since  the  columns 

of  the  matrices  are  orthogonal  we  have  v(t,  1;  t,  1)  * 0 for  all  pairs  t ^ t- 

For  columns  u(t,  1),  u(t,  j)  with  1 j condition  (21)  Is  satisfied  If  there  Is 

an  asymptotically  uniform  distribution  of  the  v(t,  1)  rows  for  which  u(t,  1)  has 

elements  1 over  a fraction  qm^  of  the  m^  columns  of  Uj  where  0 < q < 1 since  the 

fraction  of  v(t,  1)  which  gives  rise  to  v(t,  1;  t,  j)  will  be  0(q  ^m  = 
a -1  J 

0(n  ) and  will  tend  to  zero. 

Next  we  must  Introduce  conditions  on  the  orthogonal  standardized  matrix  X 

with  elements  x . Denote  by  Y x^  the  sum  of  x^  over  those  rows  for  which 

s(t,l) 

u(t,  1)  has  a 1 element  then  we  assume  that 


a.-l 

I > 


8(t,l) 


“l 

Since  ^ x^  “1  and  the  number  of  terms  In  ^ Is  v(t,  1)  * 0(n  ) condition 

8 S(t,l) 

(23)  Implies  that  asymptotically  the  x^^  have  a uniform  density  x^^  » 0(n  ^). 

Finally  we  place  on  record  a consequence  of  conditions  (18)  to  (23):  It 
follows  from  (5)  using  (18),  (19),  (23)  and  Schwartz*  Inequality  that 
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u’(t,  1)  v(t,  j)  - 


2o  -1 

v(t,  1)  + 0(n  ^ ) for  t - T,  i a J 

20^-1 

. 0 + 0(n  ) for  t y T,  i “ J (24) 

v(t,  1;  T,  j)  + 0(n  ■’  ) for  1 y j 


We  now  turn  to  the  asymptotic  behavior  of  the  and  . From  (8),  (17),  (20), 
and  (25)  we  have  that 


(u'(t.  i)  V(T,  i))2 

T-1 


■ I 1 ^u'(t,  i)  v(t,  i)  ^ ^u’(t,  i)  v(t,  i)  ^ 

1-0  +2a.  2*-2o  +4a  -2 

^ Const  n + 0(n  ) 


(25) 


> C n 


for  all  i “ 1,  , c + 1 


From  (8),  (17),  (19),  (21)  and  (24)  we  have  for  iyj;i«l,  ...,  c+1; 
j “ 1»  •••»  c 


"i  "j  r "X' 

44  “ I Z ^ U'(t.  1)  V(t,  J)  / 

J t-1  T-l  •' 


m 


n n. 

1 J 


i J a.+a  -1 

- Z r v(t,  i;  T,  j)2  + o(n  ^ J ) Z Z v(t,  i;  t, 


j) 


t T 


t T 


2o.+2o-2 
+ ZZ  0(n  * J 

t T 


) 


"i  «4+“ri 

- I o(v(t,  D)  I v(t,  i;  T,  J)  + 0(n  ^ 


)n 


(26) 


2-o.-a.  2o.+2a.-2 

+ 0(n  ^ J)  0(n  ^ J ) 


r 
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1+a 

o(n  + 0(n  - o(n  ) 


(26) 


1+a. 


since  Oj  < 1.  Similarly  we  prove  by  symmetry  that  -0(0  for  1 ^ j £ c 


From  (25)  and  (26)  it  is  clear  that  for  all  large  n the  c x c matrix  k..  for 


Ij 


1+0, 


1,  j •=  1,  ....  c is  asymptotically  diagonal  with  diagonal  coefficients  cn  ^ 
while  the  coefficients  k^^^  ^ are  asymptotically  equal  to  o(n) . Moreover  it  i.*; 
obvious  from  (12)  that  k^^j^  > Cn.  Using  therefore  the  first  c equations 

of  Ko^  = 9(y)  we  obtain  that 


- 0(n  ^ ) {Q^(y)  - o(n)o2^^j^}  - 0(n  ^ )Q^(y)  + o(n 


for  1 - 1 , . . . , c 


(27) 


Substituting  (27)  In  the  last  equation  we  obtain 


(28) 


or 


®c+l  “ o(n  ^ ) 


(2«) 


Substituting  (29)  back  in  (27)  we  obtain 


-a  -1  -1-0. 

j2  - 0(n  ) Q^(y)  + o(n  ^)  Qjj^i(y) 


(30) 


Equations  (29)  and  (30)  show  that  o^  Is  estimable  from  the  Qj^(y).  They  also 
show  that  0^  Is  consistent  provided  we  can  show  that 
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2a  +2 

Var  Q^(y)  - o(n  ^ ) 


Var  "0(11) 


for  r - 1,  . . . , c 


(31) 


since  CovQ^(y)Qj(y)  = 0(VarQ^(y)**  VarOj(y)**). 

In  order  to  prove  the  first  result  in  (31)  we  use  formulas  [22],  [32]  , 

[33]  and  [34]  of  J.N.K.  Rao  (1968)  with  slightly  altered  notation.  Formula  [22] 
gives  E Q|(y)  in  the  form 


c+1 


c+1 


c+1 


E(Q  (y)2)  = 2 n c 02^2  + I c a;[  + I h p (32) 

i<j=l  ^ J ■ i=i  1 i.i  1 ^1 


where  = E are  the  4^^^  moments  of  the  elements  of  b^.  Noting  that 

Var  Q^(y)  •=  E Q^(y)2  - E2(Q^(y))  the  leading  terms  of  c^^  and  c^j  given  by 
J.N.K.  Rao's  equations  [33]  and  [32]  cancel  and  we  are  left  to  consider  the  orders 
of  magnitude  of 

“l 

Cii-2hi  - I I^{Qj.(u(t,  i)  + u(t,  i))  - Q^(u(t,  D)  - Q^(u(t,  1)}2 


in. 


(33) 


1 r 

- I I I I 2(u{t,  1)'  v(s,  r))  (u(t,  D*  v(s,  r))}2  . 

t<T"l  S"! 


Consider  first  the  case  r » 1.  We  distinguish  two  terms  when  s = t and  s • t. 

For  those  two  terms  (u(t,  i) ' v(s,  i))  (u(t,  1)'  v(s,  1))  is  from  (24)  of  the 

o^  2a. -1  3a,~l  ”r 

order  of  magnitude  0(n  ) 0(n  ) ■ 0(n  ).  For  the  remaining  terms  in  ^ 

8"1 

4aj“2 

the  product  is  of  the  order  0(n  ) but  the  number  of  terms  is  of  the  order 
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1-0.  6o.-2  ^ . 2-2o.  6o  -2 

0(n  ) 80  that  Is  0(n  ) and  hence  - 0(n  ) 0(n  ^ ) 


4o.  2aj^+2 

0(n  ■ o(n  ) since  a^<  1. 


Consider  next  the  case  r i and  r y c + 1.  We  have  from  (33)  and  (24) 


m. 


• r 2o  +2a  -2 

c..  = Z i (Z  (^(t,  1;  s,  r)  v(t,  i;  s,  r)  + 0(n  ^ ^ ) 


t<T  s 


0 +0  -1 

+ 0(n  ^ ) (v(t,  1;  s,  t)  + v(t,  1;  s,  r))}^ 


m 


1 

t<T 


I I {o(v(s,  r))  ^ v(t,  1;  s,  r)  + 0(n  ^ ) 


2oi+o^-l 


+ 0(n“^  ) (v(t,  1)  + v(t.  1))}2 


(34) 


m. 


1 o +o 

“ I I {o(n  ’^  ^)  + 0(n 

t<T 


2a^+a  -1 


)}- 


2+20 


o.+2o  +1  2o_,+2o 


o(n  ’^)  + o(n  ^ *^  )+  0(n  ^ 


2+20 

- o(n  . 


The  case  ri<l,  r“c+l  follows  on  the  same  lines  as  (34)  except  that  o^  » 0 

and  that  v(t,  i;  s,  c+1)  v(t,  i;  s,  c+1)  - 0 since  u(s,  r)  has  a 1 only  in  the 

8^**  row  and  either  u(t,  i)  or  u(t,  l)  have  a zero  in  that  row.  The  order  of 

2o  -1  ^ 20 

magnitude  of  O will  therefore  be  0(n  ) and  will  be  0(n  *■)  * o(n^). 

The  treatment  of  the  c^^  in  J.N.K.  Rao's  formula  [33]  follows  on  similar 
lines  to  the  above  proof  for  the  c^^  if  of  the  two  alternatives  i < j , j < i 
in  (21)  the  smaller  o^,  o^  is  selected  for  majorisatlons. 
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It  remains  Co  consider  Che  terms 


m B 

.1  _r 


h.  ■ Z Q^(u(t,  D)  - Z { Z (u'(t,  1)  v(s,  r))^}^ 
t=l  ^ t-1  8-1 


(35) 


For  the  case  r - 1 we  have  using  (24) 


n. 


1 1 

h -Z  Uu'(t,  1)  v(t,  1))2  +Z  (u’(t,  1)  v(s,  1))^}^ 

t-1  8t*t 


1 2a  3a  -1 

- Z tO(n  ^)  + 0(n  ^ )}2 

t-1 


(36) 


l+3a  4a  5a  -1 

0(n  ^)  + 0(n  ^)  + 0(n  ^ ) 


2a  +2  2a  +2 

o(n  ) - o(n  ^ ) 


for  1 - r 1*  c + 1, 


- o(n^) 


for  1 - r - c + 1. 


For  the  case  1 ^ r and  r c + 1 


m m 
1 r 


a 40  -1 

h.  - Z < Z (v(t,  1;  s,  r)  4 0(n  ‘ ))2}2 

t-1  8-1 


m^  m 
1 r 


o^+a  -1 

" Z t Z r))v(t,  1;  8,  r)  4 0(n  )J  v(t,  1;  s,  r) 

t-1  8-1  8 


1-0  20  42“  -2 

4 0(n  0(n  ^ ))2 


"l  o.4o  2o  4a  -1 

- I {o(n  *^)  4 0(n  ^ )}^ 

t-1 


o.42o  41  2o  42o  3o  42o  -1 

- o(n  ^ '^  ) 4 o(n  ^ .4  0(n  ^ ' ) 


(37) 
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2+2a 

- o(n  . • 


(37) 


Finally  for  r=«c  + l,  l^rwe  have 


1 n 2 

h - ^ { )!  (v(t,  i;  s,  r)  + 0(n  ))2} 

t-1  s 1 


”l  2a  -1  2 

“I  ( Z ''(t,  i;  8,  r)2  + I v(t,  i;  s,  r)  0(n  ) + 0(n  )} 

C-1  8 8 


(38) 


Hw  since  v(t,  1;  s,  c + 1)  Is  either  0 or  1 we  have  that  J]  v(t,  1;  s,  c + 1)^  « 

s 

X v(t,  1;  s,  c + 1)  - v(t,  1)  so  that 

8 


(39) 


Since  l8  unbiassed  and  Cov  (o^)  -►  0 as  n " It  follows  that  is 
consistent.  Moreover  If  we  replace  any  negative  o^  by  0 the  resulting  statistic 
say  has  a smaller  mean  square  error  and  hence  Is  also  consistent. 


I 
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The  consistent  estimator  3 may  serve  as  a starting  value  for  the 

Iterative  maximum  likelihood  estimation  procedure  described  by  Henmerle 

and  Hartley  (1973).  Under  certain  regularity  conditions  (not  discussed 

here)  one  single  cycle  of  the  Iteration  will  result  In  asymptotically 

2 

efficient  estimators  of  o and  a.  If  the  Iteration  Is  carried  to  convergence 
solutions  of  the  ML  equations  are  reached.  If  no  ML  cycles  are  performed 
a consistent  estimator  3 of  a can  be  computed  from  the  generalized  least 
squares  (ML)  equations. 


a - (x'h~^x)”^(x’h"V) 


c 3. 


where  H » I + E 
n 


1-1  3 


“ ^1^1 
c+1 


(40) 


It  has  been  shown  by  Hemmerle  and  Hartley  (1973)  that  (40)  can  be  computed 
directly  from  the  U^Uj^  and  X'U^  matrices  without  the  Inversion  of  the 
n X n matrix  H using  their  so  called  W transformation.  In  fact  the 
matrix  (their  equation  (19))  is  essentially  given  by  the  matrices 

(see  the  above  Schedule  1)  and  by  the  contrasts  V^y  required  in  the  computation 

of  Qj(y). 

The  variance  covariance  matrix  of  3 can  likewise  be  computed  through 
the  W transformation. 
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